18 research outputs found
Representations of sources and data: working with exceptions to hierarchy in historical documents
No abstract available
Representing text as data: the analysis of historical sources in XML
In conventional approaches to computer analysis of historical sources, one must represent the data in structured formats in which content and context are discarded. Extensible markup language (XML), developed for data transfer on the Internet, permits preservation of the full text of irregular historical sources without sacrificing the ability to conduct systematic analysis. Related querying tools offer most functions of a relational database management system, including data transformation facilities for coding, standardizing, and aggregating nominal data. An XML database permits multiple interpretations of the data because the unit of analysis and the coding schemes are not defined at entry. In a case study of probate inventories, the author demonstrates how structured analysis of domestic interiors can be performed and introduces approaches to studying semistructured data